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Abstract. The methods of Shub and Smale [SS93] are extended to the 
class of multihomogeneous systems of polynomial equations, yielding The- 
orem 1, which is a formula expressing the mean (with respect to a partic- 
ular distribution on the space of coefficient vectors) number of real roots 
as a multiple of the mean absolute value of the determinant of a random 
matrix. Theorem 2 derives closed form expressions for the mean in special 
cases that include: (a) Shub and Smale's result that the expected num- 
ber of real roots of the general homogeneous system is the square root 
of the generic number of complex roots given by Bezout's theorem; (b) 
Rojas' [Roj96] characterization of the mean number of real roots of an 
"unmixed" multihomogeneous system. Theorem 3 gives upper and lower 
bounds for the mean number of roots, where the lower bound is the square 
root of the generic number of complex roots, as determined by Bernstein's 
[Ber75] theorem. These bounds are derived by induction from recursive 
inequalities given in Theorem 4. 
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The Expected Number of Real Roots of a 
Multihomogeneous System of Polynomial Equations 



1. Introduction 

The study of the distribution of real roots of a polynomial with ran- 
dom coefficients, which traces back at least to [BP32], has recently been 
developed in the direction of multivariate systems. (This literature is 
ably surveyed, and extended, by Edelman and Kostlan [EK95].) Kost- 
lan [Kos93] shows that, for a homogeneous polynomial equation of degree 
d in n + 1 variables, a particular inner product on the space of coefficient 
vectors is distinguished by invariance under the natural action of 0(n + 1) 
and orthogonality of monomials. He goes on to show that, for the system 
of n such equations, when the coefficient vectors for the various equations 
are independent random variables, with each one distributed according 
to the central normal distribution associated with this inner product, the 
mean number of projective roots in n-dimensional real projective space 
is d n / 2 , which is the square root of the generic number of complex roots 
given by Bezout's theorem. Shub and Smale [SS93] extend this result to 
the general homogeneous system of n homogeneous polynomial equations 
of degrees d\,...,d n , showing that the mean is y^YU di, which is again 
the square root of the Bezout number. Rojas [Roj96] studies unmixed* 1 ) 
systems of multihomogeneous equations, arriving at a closed form formula 
for the mean number of roots in the cartesian product of projective spaces 
that is the natural root space for such systems. 

This paper studies the more general case of mixed multihomogeneous 
systems. Theorem 1 is a formula expressing the mean number of real roots 
of a random multihomogeneous system as the product of the mean absolute 
value of the determinant of a random matrix times an expression composed 
of evaluations of Euler's function V at multiples of 1/2. Theorem 2, which 
is a corollary, gives a closed form formula for this mean, for a smaller class 
of systems that includes both the general homogeneous system and the 
unmixed systems as special cases, so that the results of [SS93] and [Roj96] 
described above are corollaries. Theorem 3 generalizes the "square root of 
the Bezout number" result by giving upper and lower bounds on the mean 



W Sparse systems of polynomial equations are described in Section 2. Roughly, 
such a system is unmixed if all polynomials have the same collection of monomials 
with nonzero coefficients, and otherwise it is mixed. 
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number of roots, where the lower bound is the square root of the maximal 
number of roots for the associated "demultihomogenized" system, as given 
by Bernshtein's [Ber75] extension of Bezout's theorem to sparse systems 
of polynomial equations. These bounds follow from recursive inequalities 
given in Theorem 4. 

The author's interest in this topic is motivated in part by concepts 
of noncooperative game theory^ 2 ). The concept of a totally mixed Nash 
equilibrium for a normal form game amounts to a root, all of whose com- 
ponents must be positive, of particular sort of multihomogeneous system. 
McLennan and McKelvey [MM97] give a method for constructing normal 
form games that have as many regular (real) totally mixed Nash equilib- 
ria as are permitted by Bernshtein's theorem. The conceptual import of 
this result is that the maximal number of Nash equilibria is large, at least 
compared to most game theorists' prior intuition. Games that have the 
maximal number of equilibria are thought to be very atypical, and there 
arises the question of whether the set of equilibria is not only potentially 
large, but also large on average. McLennan [McL97] investigates the ap- 
plication, to this problem, of the results developed here, using Theorem 3 
to show that the mean number of Nash equilibria can grow exponentially 
with various measures of the size of the game. Among other things, this 
analysis involves the extension of our work here to systems consisting of 
a multihomogeneous system of the sort studied here to which additional 
multihomogeneous polynomial inequalities have been appended, with the 
generalized formula being the one given here times a factor that may be 
regarded as the "probability" that a root of the system of equations also 
satisfies the inequalities. 

In connection with speculation concerning whether analogues of The- 
orem 3 might hold for more general classes of sparse systems than the 
multihomogeneous ones, we recommend [Roj] , which gives an extension to 
general sparse systems of the model of a random system studied here, and 
which presents results and conjectures along these lines. It is interesting 
to note that multihomogeneous systems are potentially special insofar as 
they can have as many real regular roots as are permitted by Bernshtein's 
theorem. (This is proved in [McL98] by pointing out that the argument 



( 2 ) This is not the place to give a general introduction to noncooperative game 
theory; Fudenberg and Tirole (1991) is a standard text. For the internal logic 
of this paper the description of quasiequilibrium (Section 2) is sufficient. For the 
connection between this notion and the standard concepts of Nash equilibrium and 
totally mixed Nash equilibrium see [MM97, McL97]. 
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in [MM97], which establishes this claim for the systems arising in game 
theory, is actually valid for any multihomogeneous system.) 

The proof of Theorem 1 parallels the analysis in [SS93] and [BCS98] 
rather closely, and is thus a descendant of the methods of [Kac43]. The 
incidence variety is the set of coefficient vector-root pairs. It is a sub- 
manifold of the cartesian product of the space of coefficient vectors and 
the root space, and the projection of it onto the root space is a fibration. 
The roots of the system at a particular coefficient vector are the preim- 
ages of the projection of the incidence variety onto the space of coefficient 
vectors, and an integral formula [SS93, p. 273; BCS98, p. 240] is used to 
reexpress the mean number of roots as a double integral, where the outer 
integral is over the root space and the inner integral is over the fibre of 
the projection onto the roots space at the root in question. Invariance 
is used to show that the inner integral does not depend on this root, so 
that the double integral is the volume of the root space times the inner 
integral, evaluated at a point in the root space which may be chosen at 
whim. For a particular choice it is possible to simplify the inner integral 
by transforming variables in a way that eliminates variables that do not 
enter the integrand, and from this Theorem 1 emerges. 

The algorithms used by [MM97] to compute maximal numbers of Nash 
equilibria are based on recursive formulas for the Bernshtein number that 
extend directly to general multihomogeneous systems. Below (see also 
[McL98]) we describe how these formulas can be seen as the consequence 
of expressing the Bernshtein number for such a system as the permanent 
(e.g. [Ego96]) of a matrix, after which the recursions are obtained by ex- 
panding along a row or column. In investigating whether the mean number 
of real roots is greater than the square root of the Bernshtein number, as 
asserted by Theorem 3, it is natural to guess that the squares of the mean 
numbers of real roots obey the corresponding recursive inequalities, which 
is the assertion of Theorem 4, since then Theorem 3 follows from induction. 
Using Theorem 1, Proposition 7.1 restates these inequalities as recursive 
inequalities for the mean absoute values of the determinants of certain ran- 
dom matrices. The proof of Proposition 7.1 is, perhaps, rather surprising 
insofar as it depends on properties of normal random variables that seem 
quite distant from the geometric starting point of these investigations. 

The remainder has the following organization. Section 2 describes 
multihomogeneous systems as a certain type of sparse system. Section 3 
specifies an inner product on the space of coefficient vectors of a multi- 
homogeneous equation that is uniquely characterized by invariance and 
orthogonality of monomials. The central normal distribution with respect 



EXPECTED NUMBER OF REAL ROOTS 



5 



to this inner product is our model of a random equation, and our random 
systems have the coefficient vectors of the various equations distributed 
independently according to these distributions. Section 4 states Theorem 

1, and in Section 5 we discuss those systems for which it is possible to 
reduce the formula in Theorem 1 either to closed form or to an expres- 
sion involving the formula applied to smaller systems. Section 6 defines 
mixed volume, states Bernshtein's theorem precisely, and shows how the 
generic number of complex roots of a multihomogeneous system may be 
computed recursively. Section 7 proves Theorems 3 and 4, and presents 
a result giving upper and lower bounds for the mean absolute value of 
the determinant of a random matrix. Sections 8-11 present the proof of 
Theorem 1. 

2. Multihomogeneous Systems 

In stating Bernshtein's theorem we will need to consider general sparse 
systems, so we describe multihomogeneous systems as a specialization of 
this concept. A sparse system of n polynomial equations in I variables is 

/(x) = (/ 1 (x),...,/ n (x)) = 0, 

where x = (xi, . . . , xg) and, for each i = 1, . . . , n, there is a nonempty 
finite Ai C N £ such that /i(x) = XlaeA /*a x ° f° r some system of coef- 
ficients f ia . (Here x a denotes the monomial f .) The general 
approach of the theory of sparse systems is to hold the n-tuple of sup- 
ports (Ai, ■ ■ ■ , A n ) fixed while treating the coefficients fi a as variables, 
for instance in the sense of studying properties that are generic in the 
space of vectors of coefficients. Such a system is said to be unmixed if 
Ai = . . . = A n ; otherwise it is mixed. Identifying a polynomial with its 
vector of coefficients, we regard Hi := M Al \ {0} as the space of polyno- 
mials with real coefficients whose supports are nonempty subsets of Ai. 
Let 

n :=n! x ... x n n . 

The system is multihomogeneous if the variables in x are divided into 
k groups, so that x = (yi,...,y fc ) where y, = (y j0 , y jU . . . , y jn .), and 
each equation is homogeneous of degree dij as a function of y J 7 for any 
given values of the other variables (yi, . . . , y^-i, Yj+i, ■ ■ ■ , Yk)- More pre- 
cisely, we require that there are nonnegative integers 5ij (i = l,...,n, 
j = 1, . . . , k) such that 

Ai = An x . . . x A ik , where Aij = { a e N nj+1 : a + «i + . . . + a nj = <% }. 
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When fi is multihomogeneous, the truth value of the proposition '/i(x) 
()' is unaffected if each block of variables is multiplied by a nonzero scalar, 
so that, in effect, there are I — k degrees of freedom. We work only with 
systems that are, in this sense, exactly determined: I = n + k, so that 
ri\ + . . . + rik = n. An instance of the type of system studied here is 
specified by the vector n and the n x k matrix 5 := (<%). 

Four particular types of multihomogeneous system figure in our dis- 
cussion: 

(a) When k = 1 we have the general homogeneous system, for which the 
problem studied here was analyzed in [SS93]. In inductive construc- 
tions it will be convenient to allow the numbers of variables in some 
blocks to be zero, and we will use the phrase 'general homogeneous 
system' to describe any multihomogeneous system with rij = n for 
some j, in which case we must have = for all h ^ j. 

(b) The unmixed multihomogeneous systems studied in [Roj96] are de- 
scribed by the condition that all equations have the same support: 
there are integers ei, . . . , such that 



(c) Generalizing (a) and (b) are the systems for which there are numbers 
d±, . . . , d n and e±, . . . , ej~ such that 6^ = diej for all % and j. 

(d) The systems arising, in game theory, from the concept of quasiequi- 
librium ([MM97]) of a finite normal form game, have, for each j = 
1, . . . , k, rij equations that are homogeneous of degree one in for 
all h 7^ j, and are homogeneous of degree zero in yj. Formally these 
systems can be characterized as follows: 



where q : {1, . . . , n} — > {1, . . . , k} is the function defined implicitly by 
the inequality 



8\j ... 5 n j 6j 



(j = l,...,k). 




if q(i) = j, 

1 otherwise, 



ni + 



. + n g(i )_i < % < m + . . . + n q (iy 
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3. An Invariant Inner Product 

Fix a pair (n, 6). Since ni + . . .+n k = n, we may index the components 
of an exponent vector a G N n+fc by the pairs (j, h) for j = 1, . . . , k and 
h = 0,...,rij. For such an a let 

aio! • . . . • a lni ! a^o' • . . . • a knk ! 

:= 7 rr • • • • • t rr 

(aio + • • • + aim)! {a k0 + . . . + a knk )} 

__ / «io + • • • + aim \ 1 / a A:0 + • • • + afcn 

V aio,...,ai ni / \ afco, • • • , a knk 

We endow each Hi with the inner product 

(fi,fl)i ■= viajfiafL- 

aEAi 

Let || • \\i be the norm derived from (•, -)^. 
Consider the product group 

G := 0(ni + 1) x ... x 0(n k + 1). 

There is the obvious component- wise action of G on M ni+1 x . . . x M nk+1 , 
and for /j G 7ij and O G G, /, o O" 1 is easily seen to be a polynomial 
function that is multihomogeneous for the same numbers Sij, so fi o O -1 
is an element of Tii. Thus the formula Ofi := fi o 0~ l defines an action 
from the left of G on 7Yj. The following generalizes [Kos93, Th. 4.2], which 
is the case k = 1. 

Lemma 3.1: The inner product (4) is the unique (up to multiplication 
by a scalar) inner product on Hi that is invariant under the action of G 
and with respect to which the monomials are pairwise orthogonal. 




Proof: Let (fi, fl)* = Y^aeAt 7 !* ( a )fiafL be an invariant inner product 
with all monomials orthogonal. We wish to show that n*(a)/r]*(a') = 
r)(a)/r)(a') for all a, a' G Ai- Fixing arbitrary a G Ai and j = 1, . . . , k, it 
suffices to establish that this formula holds for those a' G Ai with = a\ h 
whenever £ ^ j, and this follows from [Kos93, Th. 4.2] applied to the 
subspace of Hi spanned by such a'. 

To see that (•, is invariant under the action of G observe that, by 
[Kos93, Th. 4.2], it is invariant under the action of any group element g 
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with only one component gj different from the identity in 0(rij + 1), and 
that such group elements generate G. I 

Following [Kos93, EK95, Roj96], in our model of a random multi- 
homogeneous system the coefficient vectors of the various equations are 
statistically independent, with the coefficient vector of the i th equation 
centrally normally distributed in Hi relative to (■, ■){. Concretely this 
means that the coefficients fi a are independent Gaussian random vari- 
ables with mean and variance ?y(a) _1 . In the setting of arbitrary sparse 
systems [Roj96] presents a definition and motivation of these variances 
that is geometric and general, in the sense that it pertains to any sparse 
system. Let \ii be the probability measure on Hi that is the distribution 
of fi, and let 

[i : = /ii x . . . x (I n 

be the distribution of / := (/i, . . . , f n ). 

In the calculations used to prove Theorem 1 we also consider the model 
in which the coefficient vectors /i,...,/ n are statistically independent, 
with each fa uniformly distributed in the unit sphere (relative to || • ||j) 
of Hi. The distribution of roots depends only on the distribution of the 
normalized coefficient vectors fi/\\fi\\i, so standard facts concerning the 
multivariate normal distribution imply that, from our point of view, the 
two models are equivalent. 

4. The Central Formula 

We count roots in the fc-fold product of projective spaces 

P := P 1 x . . . x P k 

where, for j = l,...,k, Pj := P nj (M) is n^-dimensional real projective 
space. In the usual way, the equation fi(C) = is meaningful for fi e Hi 
and C G P even though fi is not a function defined on P. Our central 
concern is the expected number of roots 

E(n,5) : =E(#({CeP:/(C) = 0})), 

but in fact we completely characterize the distribution of roots. 

Let Z be a random n x n matrix with rows indexed by the inte- 
gers i = 1, . . . , n, columns indexed by the pairs jh for j = 1, . . . , k and 
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h = l,...,n J5 and entries z\ that are independently distributed nor- 
mal random variables with mean zero and variance Sij. Let r(s) := 
J °° exp(— t)t s_1 dt be Euler's function. 

Theorem 1: 

(a) 

E(nJ) = 2-^ ■ (n-^r) -E(|detZ|). 

j=l L \ 2 ) 

(b) The induced distribution of roots is uniform: for any open W C 
P, 

E(#({C G W : /(C) = 0})) = 



This will be proved in Sections 8-11. The next three sections describe the 
consequences of this result. 



5. Reduction to Closed Form 

In certain circumstances the RHS of (2) can be reexpressed in closed 
form or in terms of the expressions derived from application of this formula 
to systems that are, in certain senses, smaller. Insofar as = y/ir, 

r(l) = 1, and r(s + 1) = sT(s) for all s > 0, the evaluations of V in 
(2) will be regarded as being in closed form already, so the problem is to 
reduce the term E(| detZ\). 

We begin by considering systems in which there is a subset of the 
variables that are determined by equations involving only those variables. 
Specifically, suppose there is some integer k' between 1 and k such that 
dij = for all i,j such that q(i) < kl and k' < j, where q(-) is the function 
defined at the end of Section 2. Set n' := ri\ + . . . + ny. Then 



5 = 



5 11 

5 2i 5 22 



where 8 11 , 5 21 , and S 22 have dimensions n' x k', (n — n') x k', and (n — n') x 
(k — k') respectively. Then (with probability one) Z has an n' x (n — n') 
block of zeros in its upper right corner, so its determinant is the product 
of the determinants of the n' x n' submatrix in the upper left and the 
(n — n') x (n — n') submatrix in the lower right. In particular, E (| det Z\) 
does not depend on 5 21 . Consequently (2) implies that E(n,5) is also 
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independent of 5 21 . When we set 5 21 = we have a cartesian product of 
two independent systems, and our assumed distribution of coefficients for 
the combined system is the product measure of the assumed distributions 
for the subsystems. For any particular coefficient vector for the combined 
system, the number of roots is the product of the numbers of roots of 
the subsystems, so the following is a consequence of the fact that the 
mean of a product of independent random variables is the product of their 
means. Computationally, it follows immediately from the fact that the 
determinant of Z is the product of the determinants of the submatrices. 

Corollary 1: Suppose there is some 1 < k! < k such that dij = whenever 
q(i) <k'< j, and let 5 11 and 5 22 be as above. Then 



A second general principle results from the effect on the determinant 
of multiplying a row or a column by a scalar. 

Corollary 2: If there are nonnegative integers di, . . . , d n and ei, . . . , ejfc 
such that S'ij = di ■ ej ■ 5{j , then 



Consider now the particular case of k = 1 and dn = . . . = 8 n i = 1. 
This corresponds to a system of n linear functionals in n + 1 variables, and 
there is exactly one projective root for almost all coefficient vectors. In 
view of (2) we must have: 

Proposition 5.1: The mean absolute value of the determinant of a 
random n x n matrix whose entries are independently distributed normal 
random variables with mean zero and unit variance is 



E(n, 6) = E((n 1 ,...,nv), S 11 ) ■ E{{n k , +1 , . . . , n k ), S 22 ). 





Combining the last two results with Theorem 1 yields 
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Theorem 2: If there are nonnegative integers di, . . . , d n and e\, . . . , 
such that Sij = di ■ ej, then 



r(i) 



r(i) U|r(^ ^ii" 



1 



it 



The Shub-Smale formula is the special case k = 1, and Rojas' formula for 
unmixed systems is obtained when d\ = . . . = d n = 1. 

There is a class of systems for which E(n, 5) can be computed exactly 
by combining Corollaries 1 and 2 with Proposition 5.1. I know of no 
case outside this class in which the expectation E(| det Z\) evaluates to a 
closed form expression. For the systems arising from normal form games 
we are able to evaluate in closed form only when k = 2, which corresponds 
to a game with two players. Applying ideas similar to those underlying 
Corollary 1 yields: 

Corollary 3: In the case of the game equilibrium system given by (1), if 
k = 2 then 

1 otherwise. 



6. The BKK Bound for Multihomogeneous Systems 

This section explains the consequences of Bernshtein's [Ber75] theo- 
rem for multihomogeneous systems. Let /(z) = (/i(z), . . . , / n (z)) be a 
general sparse system of n equations in the n variables zi, . . . , z n , where fi 
has support A\ C N n . The Newton polytope of fi is the convex poly tope 
Qi = con(Ai). The mixed volume of Q±, . . . , Q n , which was first defined 
and studied by Minkowski, and which we denote by MV{Q\, . . . , Q n ), 
may be defined to be the coefficient of the monomial Ai • . . . • X n in the 
polynomial^ 3 ) voI(Qa) where 

Qx — AiQi + . . . + X n Qn- 

Theorem: ([Ber75]) Let C* := C \ {0}. Let H c = x ... x H% 
where H'p = C Al is the space of complex polynomials with support Ai. 

( 3 ) See [Ewa96] for a proof that vo1(C}a) is, in fact, a polynomial function of A. 
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For systems f in the complement, in Ti , of an algebraic set of positive 
(complex) codimension, there are AiV(Qi, . . . , Q n ) roots in (C*) n . 

The maximal number _MV(Qi, . . . , Q n ) of roots is often referred to as the 
"BKK bound" of the system in recognition of closely related work [Kus76, 
Kov78]. 

We apply this result to the "demultihomogenized" system obtained, 
from the given multihomogeneous system, by setting y w = . . . = y^o = 1. 
In comparing the roots of the latter system, in (C*) n , with the roots, in P, 
of the given multihomogeneous system, there is the possibility of roots in 
one of the coordinate subspaces (in the projective sense) along which one of 
the variables vanishes, but invariance under the action of G quickly implies 
that generic systems do not have such roots, or roots at projective infinity. 
Thus, generically, there is a one-to-one correspondence between the roots 
of the given multihomogeneous system and of the demultihomogenized 
system. The Newton polytope of the i th demultihomogenized equation is 
Qi = Uj: nj> oSij^(nj), where 

A(nj) := { (z ju . . . , z jn .) E iR> J :z jl + ... + z jn . < 1 }, 
and the generic number of complex roots of the system is 

BKK(n,6):=Mv( J] 5 ±J A(n J ), . . . , J] ^A(n,)). 

j:rij>0 j:rij>0 

Our analysis of this quantity employs the following concept. The 
permanent (e.g., [Ego96]) of an m x n matrix D with entries dij is 

per£>:= (II^w) 

<T£S mj „ \i=l / 

where S m)n is the set of one to one functions from {1, . . . , m} to {1, . . . , n}. 
Since there are no such functions when m > n, in which case perD is 
automatically zero. Note that multiplying any row of D by a scalar has 
the effect of multiplying the permanent by that scalar, and that we may 
expand by minors along any row: for each % = 1, . . . , m 

n 

per D = ^ dij • per D lJ 
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where D l i is the (m — 1) x {n — 1) matrix obtained from D by eliminating 
the i th row and the j th column. When m = n the permanent of D agrees 
with the permanent of its transpose, and these comments hold with rows 
and columns reversed. 

Let A(n, 5) be the n x n matrix whose first ri\ columns are the first 
column of 5, whose next n<i columns are the second column of 5, and so 
forth. The computation 



n n 

voi (s>( n <%ako))=voi( n (5>^) a k-)) 

i=l j:rij>0 j:rij>0 i=l 



nil • . . . • n k l 



has the following immediate implication: 
Proposition 6.1: ([McL98]) 

per A(n, 8) 



BKK(n, 6) 



nil ■ . . . • n k V 



The next result enumerates consequences of the elementary properties 
of the permanent, applied to this result. For i = 1, . . . , n let 8~ l be the 
{n — 1) x k matrix obtained by eliminating the i th row of 6. For j = 1, . . . , k 
let ej be the j th standard unit basis vector of M k . In the recursive formulas 
below we are adopting the convention that 

BKK((0, ...,0),S o ) = E((0,...,0),S o ) = 1, 

where 8q is the x k matrix. This means that the "null system" with no 
variables and no equations has one root. 

Proposition 6.2: ([McL98]) 

(a) For all i = 1, . . . , n, 

BKK(n, S) = 6 v • BKK ( n - e j, 

j:nj>0 

(b) For all j = 1, . . . , k such that rij > 0, 

1 n 

MK(n, 5) = — S ij ■ BKK(n - e J7 <T J ). 
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(c) Suppose there is some 1 < k' < k such that 5^ = whenever 
q(i) <k'< j, and let S 11 , S 21 , and S 22 be as in Section 4. Then 

BKK(n, 5) = BKK((n u ...,n k ,), S 11 ) ■ BKK((n k , +1 , n k ),5 22 ). 

(d) If there are nonnegative integers d±, . . . , d n and e\, . . . , e k such 
that = di ■ ej ■ 5^, then 

n 

5KfC(n,5 / )=(n^)'( II e?)-BKK(n,8). 

i=l j:rij>0 

The recursive formulas (a) and (b) give obvious algorithms for com- 
puting BKK(n, 5) that have computed values of BKK on the order of 10 21 . 
(Cf. [MM97].) 

In preparation for Theorem 3, we ask when BKK(n, 5) can be com- 
puted by repeated applications of (a) in which the RHS has only one 
nonzero term. We say that the pair (n, 6) is simply reducible if the follow- 
ing inductive definition is satisfied: there is some i for which there is at 
most one j with nj > 0, Sij > 0, and BKK(n — ej, 5~ l ) > 0, and if n > 1 
we require that for this j, (n — e^-, 5~ l ) is also simply reducible. This will 
clearly be the case when repeated applications of (c) reduces BKK(n, S) 
to a product of instances of the general homogeneous system. In fact this 
is the only way that (n, 5) can be simply reducible, as we shall see in the 
next section. 

We will need the following technical result. Let A be an m x n matrix 
of 0's and l's. We say that an m x n matrix D = (dij) is A- sparse if 
dij = whenever = 0. 

Lemma 6.3: The following conditions are equivalent: 

(i) there is an integer 1 < k < m such that, after relabelling of rows 
and columns, A has a k x (n + 1 — k) block of 0's. 

(ii) per A = 0; 

(in) all A-sparse matrices have row rank less than m. 

Proof: Clearly (i) implies (ii). The meaning of (ii) is that for each one- 
to-one a : {1, . . . , m} — > {1, . . . , n} there is some i such that a ia ^ = 0, 
which implies that all A-sparse matrices have no m x m submatrices of 
full rank, so (ii) implies (iii). Assuming that (iii) holds, we may assume 
without loss of the generality that the first k rows of A are minimally 
linearly independent: for a generic A-sparse matrix D their span agrees 
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with the span of any (k — l)-element subset. Reordering columns, we may 
assume that, for generic D, the projection of the span of the first k rows 
onto the space of the first k — 1 columns has full rank. Now the upper 
right hand k x (n — (k — 1)) block of A must vanish, since otherwise it is 
straightforward to construct an A-sparse matrix D whose first k rows are 
linearly independent. ■ 



7. The Mean Exceeds the Square Root of the Maximum 



Let Ai(n,5) be the n x n matrix whose (i, j/i)-entry is \/Sij- This 

section establishes the following generalization of the Shub-Smale formula. 

Theorem 3: 



These inequalities hold with equality when (n, S) is simply reducible and 
not otherwise. 

Theorem 3 will follow by induction from the following stronger result. 
Theorem 4: For all i = 1, . . . , n, 



£ ■ E(n - e 3 :,cT*) > E(n,S) > £ • E(n - e j7 5"*) 2 , 



These inequalities hold with equality if and only if there is at most one j 
with dij > and E(n - ej, <5 -i ) > 0. 

Proof of Theorem 3: The asserted inequalities follow from an induction 
on n that begins with the convention that £'(n, S) = BKK(n, S) = 1 when 
ri\ = . . . = rtf- = 0. The induction step is a matter of comparing (a) 
of Proposition 6.2 and the analogous formula for perAi(n, 5) with the 
inequalities in Theorem 4. Moreover, Theorem 4 implies that either of 
the inequalities in Theorem 3 holds with equality if and only if there is 
at most one j with E(n — e^, 5~ l ) > 0, and E(n — e^, 5~ l ) also satisfies 
the inequality with equality. In particular, it follows from induction that 
.E(n, 6) > if and only if BKK(n, 6) > 0, so either of the inequalities in 
Theorem 3 holds with equality if and only if (n, 5) is simply reducible. ■ 





j:nj>0 
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Remark: We can now give a direct characterization of simple reducibility. 
Applying Theorem 3 to the situation laid out in Corollary 1 and (c) of 
Proposition 6.2 shows that (n, 6) is simply reducible if and only if both 
((ni, . . . , nk'), S 11 ) and ((rik'+i, • • • , n>k), S 22 ) are simply reducible. Thus it 
suffices to characterize simple reducibility when the hypotheses of (c) of 
Proposition 6.2 are not satisfied: there is no 1 < k' < k such that (after 
any reordering of rows and columns) Sij = whenever q(i) <k'< j. The 
inequality of Theorem 3 cannot hold with equality unless all instances of 
the inequality in Theorem 4 hold with equality, so we see that if (n, S) is 
simply reducible, then for any i = 1, . . . , n there is at most one j with 
rij > 0, Sij > 0, and BKK(n — ej,5 _l ) > 0, with (n — ej,S~ l ) simply 
reducible if n > 1. If there is some i for which there exist distinct j, j' with 
rij > 0, riji > 0, Sij > 0, and 5^/ > 0, then either BKK(n — e J 7 S~ l ) = 
or BKK(n — ej>, 6~ l ) = 0, in which case Proposition 6.1 and Lemma 6.3 
imply that A(n — e J 7 5~ l ) or A(n — e^/, 5~ l ) has a block of zeros, as per (iii) 
of Lemma 6.3, and this implies that the hypotheses of (c) of Proposition 
6.2 are satisfied by (n, S), contrary to assumption. For each i there is 
consequently at most one j with rij > and S^ > 0. If there is more than 
one j with rij > it is again easy to show that the hypotheses of (c) of 
Proposition 6.2 are satisfied by (n, S), so rij = n for some j. That is, we 
have the general homogeneous case. 

It remains to prove Theorem 4. For the random matrix Z of Theorem 
1, let Z l - h be the determinant of the (n — 1) x (n — 1) minor obtained by 
eliminating row i and column jh. Observe that, by Theorem 1, 

E(l%l)=2-^(lI E T(fy i )^-e^- i ), 

so, applying Theorem 1 again to express E(n,5) in terms of E(| detZ|), 
we quickly find that the assertion of Theorem 4 is equivalent to: 

Proposition 7.1: For all i = 1, . . . , n, 



e MT7y) 2 E(i%D 2 . 

;:n,->0 ^ 2 J 

These inequalities hold with equality if and only if 5 i j~Y?i(\Z'j l \) > for at 
most one j. 



E 

n,->0 



-r(^±i) 



E (l%l) 



> 



E(|detZ|) 

7T~ 



> 
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The proof of this will be our goal for the remainder of the section. The 
next result describes the source of the inaccuracy of the approximation. 

Lemma 7.2: If x is a M> Q -valued random variable for which E(x) is 
defined, then 

m 

£E(|x h |)>E(||x||)>||E(x)||. 
h=i 

The first inequality holds with equality if and only if the support of the 
distribution of x is contained in the union of the coordinate axes. The 
second inequality holds with equality if and only if the support of the 
distribution of x is contained in a single ray emanating from the origin. 

Proof: Since J2 h E (\xh\) = E(X^ \%h\), the first inequality follows from 
J2T=i \%h\ > II ^ II 7 an d it holds with equality if and only if, with proba- 
bility one, Y^h=i \%h\ = II x II- The second inequality follows from Jensen's 
inequality, and it holds with equality if and only if ||(1 — ol)xq + a:ri|| = 
(1 — a) | l^o 1 1 + a || ^1 1| for any xq,x\ in the support of the distribution of x 
and any < a < 1. ■ 



We will need the following technical fact. 

Lemma 7.3: Let e = (ei,...,e m ) where ei,...,e m are independent 
identically distributed normal random variables with mean zero and unit 
variance. Then 



El 



v^-r(^) 
r(f) ■ 



-oo roo 

Efllein = ' 



Proof: We compute that 

/OO POO -I -I 

... INI • (^=e"^/ 2 cfai) • • • • • (-=e"^/ 2 dx m ) 
-oo J-oo V^TT V^TT 

= {2n)~ m ' 2 [ ||x||-e-^ll 2 / 2 ^ 

J R m 
poo 

= (27r)" m / 2 / (re" r2/2 ) • vol^ 771 " 1 ) • r 771 " 1 dr. 



o 



The asserted formula is now obtained from the formula (e.g., [Fed69, 
p. 251]) 

TV J.") 771 

vol(^" 1 ) = 2 T ^ (m>l) 
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the change of variables t := r 2 /2, the fact that r(|) = y/n, and the 
definition of T(-). I 



The next result expresses the central idea of the method, which ex- 
ploits a special property of random normal variables, in its simplest form. 
Random matrices have been studied extensively [Gir90, Meh91, Mui82] 
but there seems to be little prior work on mean absolute values of random 
determinants. 

Proposition 7.4: Let E be an n x n matrix whose entries e a b are 
independently distributed normal random variables with mean zero and 
variance o 2 ah . For 1 < a, 6 < n let E ab be the determinant of the (n — 1) x 
(n — 1) minor of E obtained by eliminating row a and column b. Then for 
any a = 1, . . . , n: 

. 1/2 

rpab I \ ^ 



/2A.5>«6-E(|#*|) > E (|det£|) > y/2ft. nr^-E(|^|) 

6=1 \6=1 

Proof: The expansion of the determinant by minors along row a is 

n 

detE = J2(-V a+b £abE ab . 

6=1 

For any numbers E , . . . , E an , elementary properties of Gaussian random 
variables imply that X^6=i( — ^-) a+b ^abE ab is a normally distributed random 
variable with mean and variance Ylb=i a ab(E ab ) 2 ■ Since (e a i, . . . , e an ) and 
(E al , . . . , E an ) are statistically independent, Fubini's theorem and (3) in 
the case m = 1 yield 



E(|det£|) = -^-E| 



J2°ab(E« b y 



Recalling that = y/ir, the claim follows from Lemma 7.2. I 

Let Ei be the n x n matrix with entries cx a &, and let £2 be the n x n 
matrix with entries a 2 ah . By an induction on n we now have: 
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Corollary: 

(2/yr) n / 2 • per Ei > E (| detE\) > (2/yr) n / 2 • ^/perE 2 . 



The upper and lower bounds in Lemma 7.2 correspond to the extreme 
cases in which the distribution of x is concentrated on the coordinate axes 
or on the ray through E(x). When the distribution of x is known to be 
invariant under the action of a group, it can be possible to show that it is 
far from these extremes. In the specific case we have in mind the group 

H = SO(M ni ) x ... x SO(M nk ) 

acts on the space of n x n matrices Z by simultaneously acting on each 
row of Z, where each row is viewed as an element of M ni x . . . x M nk . 
Then, because the determinant of a linear transformation between inner 
product spaces is invariant under composition with orientation preserving 
orthogonal transformations of the domain or range, we have det^Z) = 
det Z for all rj G H and all n x n matrices Z. 

Let Z % - h denote the determinant of the (n — 1) x (n — 1) minor obtained 
from Z by eliminating row i and column jh. Define the function a from 
the space ofnxti matrices to M n by letting q(Z) be the vector with 
components c^(Z) = (_i)*+»i+-+»i-i+'»2'* fc . (Of course q(Z) is inde- 
pendent of the i th row of Z, and is called the cross product (cf. [Spi65], 
pp. 84-5) of the remaining n — 1 rows.) 

Lemma 7.5: q is equivariant: Ci(rjZ) = r)Ci(Z) for all n x n matrices Z 
and all r\ E H. 

Proof: Let Zi denote the i th row of Z. Then for any rj e H we have 

Zj ■ Ci(Z) = det Z = det (rfZ) = rjZi ■ Ci(r]Z). 

Since q(Z) and Ci(rjZ) are independent of Zi, and this holds for all Zi, it 
must be the case that Ci(rjZ) = rjCi(Z) for all n x n matrices Z and all 
r] e H. I 

Proof of 7.1: As in the last proof, we write det(Z) = Zi ■ Ci(Z). As 
in the proof of Proposition 7.4, elementary properties of normal random 
variables and Fubini's theorem imply that 



E(|detZ|) = 4(Tp E ^ 



j = l h=l 
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r(|) 



Combining this with Lemma 7.2 yields 



vgjT(l) A 

m 2 j j= i 



> E(|detZ|) > 



2-r i 



r(|) \ 



For j = 1, . . . , k let IT-,- : M n — > iR nj be the projection 

n j -(z 11 ,...,z lni , ... ,z fcl ,...,z fcnfc ) = (z J ' 1 ,...,z J ' n 0- 

Clearly LTj is equivariant: Hj(rjz) = r)j(ILj(z)) for all z G iR n and 77 = 
(771, . . . , r]k) € H. Therefore IXj o is equivariant. By virtue of elemen- 
tary properties of the multivariate normal, the distribution of the random 
matrix Z on the space of n x n matrices is invariant under the action of 
H, so the distribution of £y = Hj(ci(Z)) is invariant under the action of 
SO(lR n i). 

If x is any iR n ^ -valued random variable whose distribution is invariant 
under the action of 0(M nj ), the ratio E(|£/ l |)/E(||x||) must agree with 
the mean absolute value of the first component of a random vector that is 
uniformly distributed on the unit sphere in M nj . In particular, by Lemma 
7.3 we have 

E(|fe||) _ E(||e||) _ r(2£!)/T(^) 



E(|ei|) r(i)/r(i) 



. , e n . are i.i.d. normal random variables 



when e = (ei, . . . , e n .) and e\ 
with mean zero. The asserted inequality follows from substituing this into 
(5). 

With respect to conditions under which the inequalities hold strictly, 
if the vector (||£ii||, . . . , ||Cifc||) has two components that are nonzero with 
positive probability, then both inequalities in (5) hold strictly, by Lemma 
7.2, and otherwise they do not. I 



8. A Reformulation 



This and the following three sections constitute the proof of Theorem 
1. We now reformulate the problem in a framework that is more amenable 
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to calculation, and which allows the application of the methods of [SS93] 
and [BCS98]. Let Mi C Hi be the unit sphere defined by (•, and let 



As a submanifold of H, M inherits a measure corresponding to the intu- 
itive notion of volume which we denote by vol(-) or (when no confusion is 
possible) M. The uniform distribution on M is Um(-) := vol(-)/vol(M). 
The analogous notation will occur in connection with other manifolds as 
well. The roots of / EH depend only on (/i/||/i||, . . . , f n /\\fn\\), and the 
random system (/i/||/i||, . . . , / n /||/n||) is uniformly distributed in M, by 
virtue of standard facts concerning the multivariate normal distribution. 

We regard Pj as the space of unordered pairs [Q] = {(j,— Cj} of 
antipodal points in Nj, where Nj is the unit sphere in IR nj+1 . Let 



For each root [Q G P of / G H there are 2 k corresponding roots in A. 

For each % let 6i : A — > Hi be the function with components 9i a (C) '■= 
77(a) _1 C a . Let F : M x — > lR n be the evaluation map with components 



The incidence variety is V = F 1 (0). Let iz\ and 7r 2 be the projections 
from V to M and N respectively. We now have 



In preparation for the result of the next section we discuss some tech- 
nical matters. 

Lemma 8.1: Each $i is equivariant with respect to the actions of G on 
N and Hf. 6,(00 = 06,(0 f or aJJ(6JV and O G G. The image of 6, is 
contained in the unit sphere of Hi . 

Proof: We have 



(Oft, O6i(0)i = (/», 4 (C)>i = fi(C) = h(O~ l (O0) = (Oh, 9i{OQ)i. 



M := Mi x ... x M, 



N := Ai x . . . x N k . 



Fi(fX) ■■=fi(0 = (fi,e i (0)i- 




Here the first equality is the invariance established in Lemma 3.1, and the 
other three equalities are essentially matters of definition. For given ( this 
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holds for all f it so O^OQ = 09,(0. Consequently ||^(OC)ll = 11^(011 for 
all ( and O. Clearly 0*(C) is a standard basis vector of Hi if Ci, • • • > Cfc are 
all standard basis vectors in iR ni+1 , . . . , M nk+1 respectively, so the second 
claim follows from the fact that the action of G on A is transitive. I 



The equation fi(C) = means precisely that fi and 9i(() are orthog- 
onal, so for (/, C) G V w e may construe 9i(() as a tangent vector in Tf i M il 
and clearly 

BF 

^(/,C)(o,...,MC),...,o) 

is nonzero according to whether %' = i. Thus (/, Q is a regular point of F, 
and is a regular value of F, so the regular value theorem (e.g., [GP65]) 
implies: 

Lemma 8.2: V is a C°° submanifold of M x N with dimV = dimM. 
Abusing notation, we let denote both of the "fibers" 

^(OCKCMXJV and { / G M : (/, C) G } 

over a point ( e AT, with the appropriate interpretation to be inferred 
from context. For each z let V^j be the set of /j G Mj with /i(C) = 0. As 
the intersection of Mj with a hyperplane, this set is a subsphere of Mj of 
codimension one. Thus V( = x . . . x V^ >n has a simple topology that 
is independent of (, and, as one might expect: 

Lemma 8.3: 7r 2 : V — > A is a C°° fibration. 

As usual, to argue this point in detail would be a longwinded and 
mundane affair, and we shall not do so. It is, perhaps, worth mentioning 
that the "group" of the fibration may be taken to be the group G intro- 
duced in Section 3, and that a suitable atlas of coordinate functions^ 4 ) is 
given by the following maps: given Co <G A, a neighborhood W C A of 
Co, and a C°° map h : W — > G satisfying h(C)Co = C f° r an C £ let 
. Vco x W _> be given by 0(/,C) := (MO/, C)- 



( 4 ) This terminology, and the definition of "fibration" we are appealing to, are 
from [Ste51, §2]. 
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9. An Integral Formula 

Sard's theorem implies that almost all points of M are regular values 
of 7Ti, so we need only consider such points in computing the average 
number of roots. Consider a regular point (/, C) of n 1 . Since T^^qV is 
mapped surjectively onto TfM by D7Ti(f, £), the restriction of DF(f,Q 
to T^N C T^ q{M x N) must be nonsingular, else (/, C) would not be a 
regular point of F. The implicit function theorem implies that there is a 
neighborhood U C M of / for which there is a smooth 67 : U — > iV with 
= C whose graph is contained in V. The condition matrix at (/, C) 
is the matrix of DG(f) which, by the implicit function theorem, is 

(dF \~ 1 dF 
C(/,C):=-(^:(/,C)) Qj{f,0-T f M^T c N. 

This linear transformation gives a description of the way polynomial sys- 
tems / are associated with roots near (/, (). Let C*(f, () : T^N — > TfM 
be the adjoint of C(/, t). 

Proposition 9.1: ([BCS98, p. 240]) For any open U C V, 

[ #(n^(f)nU)dM= [ [ det(C(f,()C*(f,Oy 1/2 dV c dN. 
Jm JNJVynu 

Lemma 9.2: If (/, C) G is a regular point of tti, then 
det (C(/,C)C*(/,C))" 1/2 = |detD/(C)|. 



Proof: For u G T iR n and G T f M = T fl M 1 x ...x T fn M n we compute 
that 

(^(/. 00, = ( *l(C)>l, • • • , (0n, *„(C)>»), «) 

n 

= Wi ^(c)>i = (<P, {vMO, v n e n (0))- 

i=l 

This means precisely that the map v i— > (i>i#i(C)) • • • , ^n^n(C)) is the ad- 
joint |f (/, 0* of |f (/, C), and in particular |f (/, C)|f (/, 0* is the iden- 
tity on TqM 71 . Since the matrix of the adjoint of a linear transformation 
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is the transpose of the transformation's matrix, substituting the definition 
of the condition matrix leads to 

-1/2 

det(c(/,oc*(/,o)- i/2 = U^^o-'i^o-r) 

BF 

|det— (/,C)| = |det£>/(C)|. 



Combining the last two results, for any open U C V we have 

/ #( n - 1 (f))dM= [ [ \ det Df(()\dV c dN. 

Jfeiri(U) JNJv c r\U 



10. Invariance 

Combining the actions of G on the various Hi (recall Section 3) we 
obtain an action of G on H given by 

Of := (/ioO- 1 ,...,/ n oO" 1 ). 

We will exploit this symmetry to further simplify the RHS of the formula 
above. 

Each Mi is invariant under the action of G on Hi, of course, so M 
is an invariant of the action of G on H, and the restriction of this action 
to M is an action of G on M. Of course N is invariant under the usual 
action of G on n^=i fR nj+1 - Combining these actions, we derive an action 
of G on M x N given by 0(f, C) := (Of, 0(). For any O E G, f E M, and 
C E N we have Of (O0 = f o 0^(00 = /(C), so: 

Lemma 10.1: V is an invariant of the action ofGonMx N: OV = V 
for all O E G. Consequently (for either interpretation of the symbol Vq) 
0(V C ) = Voc for all ( and O. 

Proposition 10.2: The quantity f v< _ \ det Df(Q\ dV^ is independent of 



Proof: Observe that 



D(Of)(OQ = D(f o O-^iOQ = Df(C) o O" 1 
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so that | det D{Of)(OC,) \ = | detDf(()\. We now have the calculation that 

/ \detDf(0()\dVot= [ \detD(Of)(0()\dV c = f \ det Df(()\dV c . 
JVoc Jv ( Jv c 

Here the first equality is an application of the change of variables formula 
with the change of variables function an isometry, so that the Jacobean is 
identically one. The claim now follows from the fact that the action of G 
on N is transitive. ■ 

Applying this to (7), for any open W C N and any Q e N we have 
/ #(n^(f)nn^(W))dM = vol(W)- [ \ det Df(()\ dV c . 

JM Jv ( 

Clearly (b) of Theorem 1 follows directly from this. The remaining task 
is to prove (a) of that result. 

11. The Final Calculations 

Fixing ( e N, let = . . . , /c,n) be the orthogonal projection of 
/ onto the subspace of polynomial systems for which ( is a root. For each 
i, \\f{,i\\ and are statistically independent, and the normalized 

vector is uniformly distributed in V^, so 



/ Jd e t ^C)| rf ^/jnjl/c, i ll)-| d e tO (|^,..,^)(0 

n „ 

= (II E (II^II)) / |detD/(C)|dU Vc . 
i=i Jv c 

Combining this with (6) and (8), we now obtain 

vol(M) •Ili=i E (ll/c,ill) Jn 
The formula (4) for sphere volume gives 



dfi 



\ Kl \, , vol (M^ = 2 [ l. „ , vol(V> = 2 Kl l „ = 
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and Lemma 7.3 yields 

y/2 . r( dim ^ ) 
E (ll/c,*ll) = -^dM«H7"' 

Since vol(M) = vol(Mi) x . . . x vol(M n ), and similarly for N and V^, we 
now have 

^) = 2-n/2 • (n fT^r) • / i det ^c(oi^- 

j=l L \ 2 > JH 

In the further evaluation of this quantity we are free to let C be any 
convenient point in N. We will compute at Co = (eio, • • • , e^o) G N where, 
for 1 < j < k, e j0 , e^i, . . . , e jnj are the standard unit basis vectors of 
jftrij+i p or j an( j j [ et a o _ 7 o, . . . , 0) G Aj, and for each i 

let a° = (a° l7 . . . , oP ik ) G A- Since Co = for all a G Ai other than 

a°, and Co* = 1, for each % V Coji :={/;£!,: / ia o =0}. For % and j 

such that 5ij > and each h = 1, . . . ,rij, let be a° with a°- replaced 
by (Sij - 1, 0, . . . , 0, 1, 0, . . . , 0) (the '1' is component h). Then T Co N is 
spanned by the n vectors 

b jh := (0, . . . , e jh , . . . , 0) (1 < j < k, 1 < h < nj), 

and elementary calculus yields 



Dfco,i((o) h jh 



f. jh if da > 

■> ia . l J 



if S^ = 0. 



In this way we obtain a description of Df^ (Co) as an n x n matrix with 
rows indexed by /i, . . . , / n , columns indexed by the pairs (j, h), and this 
(i,jh) -entry. Recalling from Section 3 that the variance of f ia jh is Sij, we 

see that the matrix of -D/^ (Co) has the same distribution as Z. In view 
of (9) this observation completes the proof of Theorem 1. 
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